On Computing the Largest Fraction of Missing Information for the EM Algorithm and the Worst Linear Function for Data Augmentation
نویسندگان
چکیده
We address the problem of computing the largest fraction of missing information for the EM algorithm and the worst linear function for data augmentation These are the largest eigenvalue and its associated eigenvector for the Jacobian of the EM operator at a maximum likelihood estimate which are important for assessing convergence in iterative simulation An estimate of the largest fraction of missing information is available from the EM iterates this is often adequate since only a few gures of accuracy are needed In some instances the EM iteration also gives an estimate of the worst linear function We show that improved estimates can be essential for proper inference In order to obtain improved estimates e ciently we use the power method for eigen computation Unlike eigenvalue decomposition the power method computes only the largest eigenvalue and eigenvector of a matrix it can take advantage of a good eigenvector estimate as an initial value and it can be terminated after only a few gures of accuracy are achieved Moreover the matrix products needed in the power method can be computed by extrapolation obviating the need to form the Jacobian of the EM operator We give results of simulation studies on multivariate normal data showing that this approach becomes more e cient as the data dimension increases than methods that use a nite di erence approximation to the Jacobian which is the only general purpose alternative available
منابع مشابه
An EM Algorithm for Estimating the Parameters of the Generalized Exponential Distribution under Unified Hybrid Censored Data
The unified hybrid censoring is a mixture of generalized Type-I and Type-II hybrid censoring schemes. This article presents the statistical inferences on Generalized Exponential Distribution parameters when the data are obtained from the unified hybrid censoring scheme. It is observed that the maximum likelihood estimators can not be derived in closed form. The EM algorithm for computing the ma...
متن کاملA Fuzzy C-means Algorithm for Clustering Fuzzy Data and Its Application in Clustering Incomplete Data
The fuzzy c-means clustering algorithm is a useful tool for clustering; but it is convenient only for crisp complete data. In this article, an enhancement of the algorithm is proposed which is suitable for clustering trapezoidal fuzzy data. A linear ranking function is used to define a distance for trapezoidal fuzzy data. Then, as an application, a method based on the proposed algorithm is pres...
متن کاملمقایسه روش الگوریتم EM و روشهای متداول جانهی دادههای گمشده: مطالعهروی پرسشنامه خوددرمانی بیماران دیابتی
Background and Objectives: Missing data is a big challenge in the research. According to the type of the study and of the variables, different ways have been proposed to work with these data. This study compared five popular imputation approaches in addressing missing data in the questionnaires. Methods: In this study, 500 questionnaires were used for self-medication in diabetic patients. Mi...
متن کاملAsymptotic algorithm for computing the sample variance of interval data
The problem of the sample variance computation for epistemic inter-val-valued data is, in general, NP-hard. Therefore, known efficient algorithms for computing variance require strong restrictions on admissible intervals like the no-subset property or heavy limitations on the number of possible intersections between intervals. A new asymptotic algorithm for computing the upper bound of the samp...
متن کاملStage Life Testing with Missing Stage Information - an EM-Algorithm Approach
We consider a stage life testing model and assume that the information at which levels the failures occurred is not available. In order to find estimates for the lifetime distribution parameters, we propose an EM-algorithm approach which interprets the lack of knowledge about the stages as missing information. Furthermore, we illustrate the implementation difficulties caused by an increasing nu...
متن کامل